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MARTINGALE APPROXIMATIONS FOR SUMS OF STATIONARY 
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By Wei Biao Wu and Michael Woodroofe 
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Approximations to sums of stationary and ergodic sequences by 
martingales are investigated. Necessary and sufficient conditions for 
such sums to be asymptotically normal conditionally given the past 
up to time are obtained. It is first shown that a martingale approx- 
imation is necessary for such normality and then that the sums are 
asymptotically normal if and only if the approximating martingales 
satisfy a Lindeberg-Feller condition. Using the explicit construction 
of the approximating martingales, a central limit theorem is derived 
for the sample means of linear processes. The conditions are not suf- 
ficient for the functional version of the central limit theorem. This is 
shown by an example, and a slightly stronger sufficient condition is 
given. 

1. Introduction. The central limit problem for sums of stationary and 
ergodic processes has attracted continuing interest for over half a century, 
and two major lines of inquiry have developed. Under conditions of weak 
dependence such as strong mixing, blocking techniques have proved effective. 
Ibragimov (1962) provides an early account of this line. See Doukhan (1999) 
and Peligrad (1996) for more recent ones. An alternative approach, due 
to Gordin (1969), uses martingale approximation to establish asymptotic 
normality; see also Gordin and Lifsic (1978). Ho and Hsing (1997), Maxwell 
and Woodroofe (2000) and Wu and Woodroofe (2000) have followed this line 
recently. Here we come down on the side of martingale approximations by 
showing that if the partial sums of a stationary process are asymptotically 
normal in a suitable sense, then the martingale structure is present and 
the result could have obtained by using it. In addition, we sharpen the 
result of Maxwell and Woodroofe (2000), so that the necessary and sufficient 
conditions meet. 
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It is convenient to address the problem using the following notation. Let 
(X n ) n( zz be a stationary and ergodic Markov chain with values in the state 
space X, and consider additive functionals 

n 

(1) S n = S n (g)=J29(Xi), 

i=l 

where g : X — > M. is a measurable function for which q{Xq) has mean and fi- 
nite variance. The partial sums of any stationary and ergodic process (£ n )nez 
may be written in this form by letting X n = ( . . . , £ n -i,£ n ) and g{X n ) = £ n . 
Let 7T denote the marginal distribution of Xq ; suppose that there is a regular 
conditional distribution for X\ given Xq, say Q(x;B) = P{X\ G B\Xq = x); 
and write Qh(x) = E[h(Xi)\X = x] a.e. (ir) for h G L l {n). 
Let a 2 . = E(S 2 ) and suppose throughout the paper that 

(2) a\ - oo 

as n — > oo. This condition is needed to avoid degeneracy since otherwise 
there exists a stationary sequence Y n such that g{X n ) = Y n — Y n -\ [Theo- 
rem 18.2.2, Ibragimov and Linnik (1971)]. It will not be repeated in state- 
ments of lemmas and theorems. Consider a doubly indexed sequence D n j 
of random variables for which D n j, j = 1,2,..., are martingale differences 
with respect to the filter Tj = a(. . . ,Xj—i,Xj) for each n; and let M n k = 
D n \ + • • • + D n k, so that M n fc, k = 1, 2, . . . , is a martingale with respect to 
J-k for each n. The D n j or M n k is called a martingale approximation (to S n ) 
if 

(3) m<xxE[(S k -M nk ) 2 ]=o(o- 2 n ). 

k<n 

A martingale approximation is called stationary if D n j, j = 1, 2, . . . , is a sta- 
tionary sequence for each n, and nontriangular if D n j = D, are independent 
of n. It is shown below that the existence of a stationary martingale ap- 
proximation is equivalent to the existence of a nontriangular one. When (3) 
holds, asymptotic normality of S n /a n is equivalent to asymptotic normality 
of M nn /a n , and this question may be addressed using the martingale central 
limit theorem [see, e.g., Billingsley (1995), pages 475-478]. 

It is shown in Section 2 that a simple growth condition on E[E(S n \Xo) 2 ] 
is necessary and sufficient for the existence of a martingale approximation. 
Then, in Section 3, it is shown that S n /o~ n is asymptotically standard nor- 
mal, conditionally given Xq, iff the approximating martingales satisfy the 
conditions of the martingale central limit theorem. These conditions are 
not sufficient for the functional version of the central limit theorem. This is 
shown by example in Section 4, and a set of sufficient conditions is developed 
there. 
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Dedecker and Merlevede (2002) have used blocking techniques to ob- 
tain necessary and sufficient conditions for conditional asymptotic normal- 
ity without assuming that the process is strongly mixing, or even ergodic. 
One of their conditions is closely related to (4), but their conditions do not 
include the existence of a martingale approximation and their uniform inte- 
gr ability condition for S 2 /n looks quite different from our Lindeb erg-Feller 
conditions, (11) and (12). Using the explicit construction of martingales, we 
are able to obtain novel asymptotic theory for the sample means of linear 
processes, important and widely used stationary processes. 

2. Martingale approximations. Below, || • || denotes the norm in an L 2 
space, which may vary from one use to the next. For example, || • || denotes 
the norm in L 2 (P) in (4), and the norm in L 2 (tt) in (5). 

Lemma 1. // 

(4) \\E(S n \X )\\=o(a n ), 

as n — > oo, then there is a slowly varying function t for which a 2 = n£(n). 

Proof. If relation (4) holds, then \E[S n (S n+m - S n )]\ = \E[S n E(S n+m - 
S n \X n )}\ < \\S n \\ x H^S'm.lXo)!! < £m,o~mO~ni where £ m — > as 771 — > 00. The 
lemma then follows from Ibragimov and Linnik [(1971), Theorem 18.2.3 and 
the Remark on page 330], after correcting for obvious typographical errors. 

□ 

Relation (4) is crucial in what follows. Since ||-E[g(Xfc)| Xo] || = ||Q fe <7||, it 
is implied by the condition, ££=i \\E[g(X k )\X }\\ = £)JJ =1 \\Q k g\\ = o(a n ), on 
the individual summands; but (4) is weaker and not unintuitive. 

Recall that the equation h = Qh + g is called Poisson's equation. Below, 
we will call a sequence h n G L 2 (ir) an approximate solution to Poissorts 
equation {for g) if 

(5) \\h n \\ + n\\{I - Q)h n - g\\ = o(a n ) 

as n — » 00. Also, if a n and b n are positive sequences, then a n ~ b n iff limn^oo a n /b n = 
1. 

Theorem 1. The following are equivalent: 

(i) Relation (4) holds. 

(ii) There is an approximate solution to Poissorfs equation (5). 
(hi) There is a stationary martingale approximation (3). 

(iv) There is a nontriangular martingale approximation. 
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In this case E{D^ ll ) ~ £(n) for any stationary martingale approximation; 
and there is a stationary martingale approximation for which maxj.<„ \\S k — 
M nk \\<3max k < n \\E(S k \X )\\. ' 

PROOF. It will be shown first that (i) (ii) (iii) (i) and then that 
(iii) (iv) =^ (i). The remainder of the proof is placed between the two 
equivalences. 

(i) => (ii). If (4) holds, let h° = g + Qg + ■ ■ ■ + Q n ~ l g. Then h° n (x) = 
E(S n \X\ = x) and Qh^(x) = E(S n \Xo = x) for a.e. x. Clearly, h° n = g + 
Qh° n -Q n g,tmd\\h° n -Qh° n \\<2\\g\\. So, \\h°J < \\E(S n \X )\\+2\\g\\ = o(a n ), 
by (4). Next, let 

_ h\ + ... + h° n 

n n — 

n 

Then \\h n \\ < maxfc<„ ||/i£.|| = o(cr n ), and h n = g + Qh n — Qh^/n. So, n\\(I — 
Q)h n - g\\ < \\Qh° n \\ = o{cr n ), establishing (5). 

(ii) => (iii). If (5) holds, let f n = g-{I- Q)h n , 

(6) D nk = h n (X k )-Qh n (X k ^ 1 ), 

and M nk = D n \ + • • • + D nk for k = 1, 2, Then D n i,D n 2, . . . are sta- 
tionary martingale differences for each n. Next, writing g(X k ) = h n {X k ) — 
Qh n {X k ) + f n {X k ) in (1) and rearranging terms then leads to S k = M nk + 
S k (fn) + Qh n (X ) - Qh n (X k ). So, 

max||Sfc - M nk \\ < n\\f n \\ +2\\Qh n \\ = o(a n ), 

k<n 

and (3) holds. 

(iii) => (i). If (3) holds, then \\E(S n \X )\\ = \\E(S n - M nn \X )\\ < \\S n - 
M nn \\ = o(a n ). This establishes the equivalence of (i)-(iii). 

For any stationary martingale approximation in (3), nE(D^ ll ) = E(M 2 n ) = 
E(S 2 ) + o(cj^) ~ a 2 ,, so that ^(Z?^) ~ i(n); and for the stationary martin- 
gale approximation constructed in the proof of (i) =4> (iii), m&x. k < n \\S k — 
M nk \\ < n\\f n \\ + 2||Q/i n || < 3max fe < n \\Qh%\\ < 3max fe < n ||E(5 n |X )||. 

(iii) =^ (iv) =^ (i). If there is a stationary martingale approximation, 
then (4) holds and there is a stationary martingale approximation of the form 

(6), say M nk = D nl H V D nk . Then \\M nk - M mk \\ < \\S k - M nk \\ + \\S k - 

M mk \\, and m\\D n \ — D m i\\ 2 = \\M nm — M mm \\ 2 < 2\\S m — M mm \\ 2 + 2\\S m — 
M nm \\ 2 . Let D k = D kk and M n = B x H V D n . Then M 1; M 2 , ... is a mar- 
tingale, \\S n - M n \\ < \\S n - M nn \\ + \\M nn - M n \\, and \\S n - M nn \\ = o(a n ), 
by assumption. Here \\M nn - M n \\ 2 = YZ=x \\ D nk - D kk \\ 2 = Yl=i II Au - 
D kl f. So, 

w .12 ^ 2||5 fc -M fcfc || 2 " 2||Sfc-M nfc || 2 
\\M nn - M n \\ 2 < J2 7 + E r = In + //«, 

fc=l fc A=l fc 
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say. Karamata's theorem [see, e.g., Theorem 0.6 in Resnick (1987)] implies 
that, for a > -1, £?=i %H{%) ~ n 1+Q £(n) / '(1 + a) . Hence J n = o[££ =1 .£(&)] = 
o[n£(n)] = o(cr 2 ). For the second term, notice that ||M n fc|| 2 = A;||L> nl || 2 and 
1 1 -D^i 1 1 2 ~ l{n). Then for some positive C and any positive e < 1/2, 

II n <4 V ll^ll 2 + ll M ^H 2 | o £ ll^-^»fcll 2 

k<ne ^ ne<k<n ^ 

<C £ [£(£;) + ^( n )] + - max ||S fc -M nfc || 2 , 

which by Karamata's theorem implies that limsup.^^^ II n / a 2 < 2Ce and, 
therefore, lim sup n ^ oc II n / a 2 = 0. Conversely, if there is a nontriangular 
martingale approximation, then ||£ , (S' n |A 7 o)|| = \\E(S n — M n \X )\\ <\\S n — 
M n \\ = o(cr n ), as above. □ 



As it is clear from Theorem 1, martingale approximations are not unique. 
Any two are asymptotically equivalent, however, in the following sense: If (3) 
holds, and if M' nk = D' nl + • • • + D' nk is a second martingale approximation, 
then 



(7) E 



max(M^ - M nk ) 

k<n 



n 

< A\\M' nn - M nn f = 4£ \\D' nk - D nk \ 

k=l 



using Doob's [(1953), page 317] inequality, and \\M' nn — M nn \\ < ||5 n — M^ n || + 
\\S n - M nn \\ = o(a n ). 

If £(n) — > oo in Lemma 1, then it is impossible to have a martingale ap- 
proximation that is both nontriangular and stationary, but if <7 2 ~ o~ 2 n, then 
it is. Maxwell and Woodroofe (2000) show that if J2n=i n~ 3 / 2 \\E(S n \X )\\ < oo, 
then there is a martingale M\ , M2 , . . . with stationary increments for which 
\\S n — M n || 2 = o(n). A simplified proof of a special case of this result is 
provided in Lemma 5, along with an explicit bound on \\S n — M n \\. 

The proof of Theorem 1 contains the explicit construction of D nk = 
h n {Xk) — Qh n {Xk-i) in terms of any approximate solution h n to Poisson's 
equation and also an explicit construction of h n . An alternative approximate 
solution to Poisson's equation is provided next. 



Corollary 1. If (4) holds, then (5) holds with h n = f\i n , where 

00 

/*(*) = £(l + e)-'Q>'- 1 $ 



for < e < 1. 
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Proof. From the definition, it is clear that (1 + e)f e = g + Qf £ and 
(I — Q)h n = g — h n /n. So, the corollary would follow from \\h n \\ = o(a n ). 
To see this, first observe that f £ = eJ2h=i(^ + £ )~ k ~ 1 ^ l ki by partial summa- 
tion, where h^x) = E(Sk\Xi = x), as above. Let V(s) = J2T=i a kS k+l - Then 
\\h n \\ = o[V(n/(n + l))]/n by (4), and V(s) ~ ±v^(l - s)- 3 / 2 ^/ 2 (l/(l - a)) 
as s | 1 by Tauberian's theorem [see, e.g., Feller (1971), page 445]. □ 

For some examples, let ... r)-i, rjo, rj\,. . . be a stationary sequence of mar- 
tingale differences with finite variance; and let . . . 9q, 9±, . . . be a se- 
quence of i.i.d. random elements that is independent of . . . T)—i, T]q, rj\, 

Then = [(... O^-i, Ok), ( • • • , %-i> %)] is a stationary Markov chain with 
values in X = N x M N , where O is the range of the 9 k and N is the nonneg- 
ative integers. Let Oj : X — > R be measurable functions for which 

oo 

^^[a J (X ) 2 7 ? 2 ]<oo. 
j=0 

Then 

oo 

(8) 4 = a 3 ( X k-j-i)Vk-j 

3=0 

converges w.p.l for each k and is of the form g{Xk). Processes of the 
form (8) include linear processes with constant ctj and 9k = 0, and are 
called quasi-linear processes below. They also include many nonlinear time 
series models, like autoregressive processes with random coefficients. Writ- 
ing = J2j<k a k-j(Xj_i)r]j and letting b n = ao + • • ■ + a n , it is easily seen 
that 

E(S n \X ) = Y^K-jiXj-!) - b-jiXj-!)]^, 
j<0 
n 

S n - E(S n \X ) =^2b n -j(X j - 1 )r] j . 

3=1 

So ; (J n = cr l,l + °"n,2' with 

oo 

o\ x = \\E(S n \X )f = $>{[& i+n (X ) - b 3 (X )]Wi}, 

3=0 

n-1 

<%,2 = \\Sn ~ E(S n \X Q )\\ 2 = £ Elb^Xofrf], 

3=1 

and (4) is equivalent to cr 2 1 = o(cr 2 2 )- In this case, by (6) in the proof of 
Theorem 1, D nk = b n (X k -i)r]k, where b n = (b H h 6 n _i)/n, by some rou- 
tine calculations, and E[b n (Xo) 2 r]i] must be slowly varying. Observe that if 
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bj(Xo), j < 0, are independent of r/i, then E[b n (Xo)"r]i 
and that cr 2 i and <r 2 2 simplify similarly. 



E[b n {X f]E(r,l 



Example 1 (Linear processes). Suppose that a n are constants and (with- 
out loss of generality) that E(rjj,) = 1. Then b n are also constants, and 



cr 



n.l 



n,2) 



iff 



(9) 



3=0 



n-1 ■ 

k=i . 



oo 
n=0 



a n ^ 0, then <r 2 2 ~ 6 2 n and 



If a n are absolutely summable and b := J2 

there is a C for which cr 2 -l < CJ2f=i YlJLi\ a j\ = °( n )> so that (9) holds. 
Relation (9) also holds if b ^ and 6 n = 6 + 0(1 /n). If ao = and a n = 1/n 



for n > 1, then 6 n ~ log(n), and cr„ 2 ~ ^log 



In this case ex 2 1 



0{n) 



°{ a n 1)1 so that (9) holds. To see this, observe that, for j > 3, + 1) < 

J/ +1 ' 

2 



1 du and [log(j + n) — log(j)] 2 < [log(u + n) — log(u)] 2 tfu, so that 



i=3 



00 / i ^ 



^log 



2 J + n 



J'=3 
roo 



< 



log 2 — 



O(n) 



Similarly, if ao = 0, ai = l/log(2) and a n = 1/ log(n + 1) — l/log(n) for re > 2, 

0[n/log 3 (n)] = o(iT 2 2 ), so that (4) holds. 



then a. 



n.2 



n/log 2 (n) and <7 2 1 



On the other hand, if a n = n @, where 1/2 < /3 < 1, then there are positive 



constants C\$ and c 2;( g for which <r„ j 
that (4) fails. 



~ Ci,/3n 



i 3 2 ^ as n — > oo for i 



1,2, so 



3. Asymptotic normality. The main result of this section is that S* : = 
S n /a n is asymptotically standard normal given Xq, as described below, iff 
there is a martingale approximation, (3), and the D n k satisfy the conditions 
of the martingale central limit theorem, (11) and (12). In more detail, let P x 
and E x denote the regular conditional probability and conditional expecta- 
tion for Too given Xq = x; and let F n denote the conditional distribution 
function 

F n (x;z) = P x (S*<z). 

Further let $ denote the standard normal distribution function; and let 
A denote the Levy distance between two distribution functions. Then by 
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asymptotic normality given Xq, we mean 



(10) 



lim f A[<S>,F n (x;-)]ir{dx} = 0. 



Jx 



Clearly, (10) implies that S* is asymptotically standard normal, but (10) is 
stronger in general; it implies that S* is asymptotically standard normal for 
related models in which Xq has any distribution that is absolutely continu- 
ous with respect to the stationary distribution. Such a property is needed, for 
example, if asymptotic normality is used to set approximate error bounds for 
Markov chain Monte Carlo experiments. See, for example, Tierney (1994). 
Under conditions of weak dependence, (10) can be deduced from (uncon- 
ditional) asymptotic normality of S*. See Proposition 1 for the details and 
the continuation of Example 1 for a case in which S* is (unconditionally) 
normal, but (10) fails. 

Lemma 2. If (10) holds, then (4) holds; that is, \\E(S n \X )\\ =o(a n ). 

Proof. The proof follows Maxwell and Woodroofe (2000), who consid- 
ered the special case <t 2 ~ cn; it is included because the lemma is crucial to 
what follows. Let denote convergence in distribution. Notice that if Z m => 
<E>, then liminfm^oo var(Z m ) > 1, where v&r(Z m ) = E{Z^ n ) — [E(Z m )] 2 . To 
see this, for J > 0, let T m j = min[max(Z m , — J), J]. Then linim^oo var(T mj j) = 
J K {min[max(u, — J), J]} 2 d&(u) — ► 1. By Corollary 4.3.2 in Chow and Te- 

J— *oo 

icher (1978), var(Z m ) > var (T mi j). So liminfm^oo var(Z m ) > 1. 

Assume otherwise that there is a 5 > such that (^(S 1 *, \Xq)\\ > 5 along 
a subsequence {n'}. By (10), there exists a further subsequence {n"} C {n'} 
such that A[$,F n u(x; ■)] — > for almost all x(tt). Let r 2 (x) = var(S*\Xo = 
x). By the result in the previous paragraph, lim inf^/—,^ t 2 « (a;) > 1 for 
almost all x(ir). Thus 1 < limunv/^oo f x T 2 ,,(x)iT{dx) by Fatou's lemma. 
On the other hand, the integral in the previous inequality equals US'*//!! 2 — 
HE^IJyo)!! 2 < 1 - 5 2 , which is a contradiction. □ 

Lemma 3. Suppose there is a martingale approximation {D n k\ for which 



1 



it 



(11) 



n k=l 



and 



1 



n 



(12) 



2 ZZ E ( D lk 1 {\D nk \>ea n }\^k-l) ^ 



" fc=l 
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hold for each e > 0. Then for any martingale approximation {D' nk } (say), 
(11) and (12) are satisfied. In addition, 



(13) 



sup 

0<i<l 



n k<nt 



0. 



Proof. Observe that E\E(D% k \T k . 
and 



E{Dl h \T k ^)\<E\D%-Dl k \ 



E (Pnk i -{\D' nk \>2ea n }\^k-l) 



< 2E(Dl ik l { \ Dnk \> £an }\J : k _i) + 2E(\D% k - D^H^fe-i). 
So, if D nk satisfies (11) and (12), then so do D' nk , since 



\fc=l 



< 



\ k=l 



\Y,W D 'nk-D n k\\ 2 =0{al), 
\ k=l 



as in (7). To establish (13), let m = \nt\, where [^J is the greatest integer 



that does not exceed x; let M' nk = D' nl + • • • + D' nk . Observe that cr^/a. 
as n — ► oo, (11) implies 



m 

~2 E E ( D mk\^k-l) -> *■ 



Since II M' 



nm 
m 



Ml 



a n k =l 

5~ l|A£im — 5*71 



+ ILSU-M' 



o(^n), 



^EKV^l) < 



E W D 'nk+D' m k\ 



\ k=l 

o{vl). 



Ell n' _ n' ||2 



\ fc=l 



Let F„(t) = c-n 2 lZk=\E{E>% k \T k _ x ). Then V^t) - t ^> 0. Let / > 2 be an 
integer. Observe that sup t<1 1 V^(t) — t\ < maxj</ \V n (i/I) — i/I\ + 1/1. By 
first letting n — > oo and then I — ► oo, (13) follows. □ 

Theorem 2. Relation (10) ZioZds iff there is a martingale approximation 
for which (11) and (12) ZioW. 

Proof. Suppose first that there is a martingale approximation (3) for 
which (11) and (12) hold. By Lemma 3, assume without loss of generality 
that the martingale approximation is defined by (6). Then, it suffices to es- 
tablish (10) for all subsequences n r , r > 1, that increase to oo sufficiently fast 
as r — > oo. Observe that D nk , k = 1,2,..., are martingale differences with 
respect to P x for a.e. x(ir) by the Markov property. If n r — > oo sufficiently 



10 



W. B. WU AND M. WOODROOFE 



quickly as r — > oo, then (12) and (13) both hold with convergence in prob- 
ability replaced by convergence w.p.l (P), and lim n _ +00 (5 n — M nn )/a n = 
w.p.l, too. So, these relations hold w.p.l (P x ) for a.e. x(tt). Then, for a.e. 
x(ir), linv^oo F Tlr (x\ z) = &(z) for all z, by the martingale central limit the- 
orem applied conditionally given Xq = x, and (10) holds (along the subse- 
quence) by the bounded convergence theorem. 

The converse will be deduced from Theorem 2 of Ganssler and Haeusler 
(1979), that provides necessary conditions for the functional version of the 
martingale central limit theorem. If (10) holds, then so does (4), by Lemma 2; 
and then there is a stationary martingale approximation, by Theorem 1. So, 
the issues are (11) and (12). Let B denote a standard Brownian motion. 
Then, since the process is stationary and S* is asymptotically normal given 



a, 



s 



,s 



\nt k \ 



S 



=f [m 1 ,m 2 -m 1 ,---,m k -m k _ 1 \ 

for every choice of < t\ < ti < ■ ■ ■ < < 1, where denotes convergence 
in distribution. For example, if k = 2, < s < t < 1, and m = \ nt\ — \ns\ , 
then 



P[S\ns\ < (T n y,Slnt\ ~ S[ns} < <* n A ~ $ 



y 



< 



.X 



J- I! 



Or, 



x\ 



ir{dx} 



which approaches zero as n - 
(14) 



P[S\ na \ <°nV]-<S> 4= 



oo since a m /a n — > \/t — s. Next let 

M*(t) = — E D nk 
° n k<nt 

for < t < 1, and M n (l) = M n ,(l— ). Then the finite-dimensional distribu- 
tions of M n converge to those of B, since \S\ n t\ — M^ nt ^\/a n for each 
< t < 1; and since E[M n (t) 2 } ~ ntE(D^ ll }/a^ l t, it follows that each 
M n (t) 2 , n > 1, is uniformly integrable for each < t < 1. It then follows 
from the martingale inequality that M n is tight in D[0, 1]. So, M n converges 
to B in D[0, 1]; and relations (11) and (12) then follow from Theorem 2 of 
Ganssler and Haeusler (1979). □ 

Example 1 (Continued). For linear processes, relations (11) and (12) 
follow from (4), which implies that D n ^ = b n r\k and that \b n \ is slowly vary- 
ing, for the stationary martingale approximation constructed in the proof of 
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Theorem 1. On the other hand, if a n = n where 1/2 < j3 < 1, then S n /a n 
is asymptotically standard normal, but (4) and (10) do not hold. 

In the next corollary, let 7Ti denote the joint distribution of Xq and X±, so 
that tti(B) = P[(Xq, X\) G B] for measurable B C ,Y 2 ; and let H n (xo,x\) = 
K(xi) - Qh n (x ), so that D nk = H n (X k _i,X k ) in (6). 

Corollary 2. If (4) ZioZds and H n /^l(n) —> H & L 2 (ttx), then (10) 
holds. 

Proof. Let D nk = H n (X k _i,X k ) be the martingale approximation (6) 

and let D' nk = ^njH(X k _ x ,X k ) and M' nk = D' nl H + L>; fc . Then the 

D' nk provide another stationary martingale approximation, since ||M nn — 
M' nn \\ 2 = n\\D nl - D' nl \\ 2 = n\\H n - ^fl(n)H\\ 2 = o(cr 2 ). Moreover, the D' nk 
satisfy (11) and (12). For example, 

\ E E(D' 2 k \X k ) = -J2 S[^(^i,^) 2 |^i]-^P(lo,Ii) 2 ] 

Gn k<nt n k<nt 

by the ergodic theorem; and E[H(X , Xi) 2 ] = \\H\\ 2 = 1, since ||-ff n || 2 
by Lemma 1. Condition (12) may be obtained similarly. □ 

To relate the condition in Corollary 2 to the sums S n , first observe that 
H n / \J l{n) converges in L 2 (tti) iff D n \j 'y '£(n) converges in L 2 (P) and next 
that D n i is the average of E(S k \Xi) — E(S k \Xo) over k = 1, . . . ,n. It is not 
difficult to see that if [EiSnlXi) - E{S n \X Q )\/ y/Kn) converges in L 2 (P), 
then so does D n \j \J £{n). Woodroofe (1992) shows how the condition of 
Corollary 2 can be related to the Fourier coefficients of g when X k is a 
Bernoulli or Lebesgue shift process. 

Example 2. For a quasi-linear process (8), D n \ = b n (Xo)r]i. So, if . . . 77-1, 
770,771, . . . arei.i.d., cr n> i = 0(^,2), and b n /y/£(n) ->• 6 / in L 2 (7r), the D nl /y/£(n) 
converges m L 2 (P) and, therefore, (11) and (12) both hold. 

3.1. Strong mixing processes. Many classical results concerning asymp- 
totic normality for stationary processes require strong mixing conditions; 
see, for example, Peligrad (1986, 1996). Here we show how the strong mix- 
ing assumption is related to our main condition (4). Let X n = (..., £ n _i, £ n ) 

and S n = £1 -I + £ n , where (£i)iez is a stationary sequence that is strong 

mixing; that is, 

a n := sup \P(AC\B) - P{A)P{B)\ 

AeT a ,BeQ n 

as twoo, where T n = er(. . . £n) and Q n = o-(£ n ,£ n+1 , . . . ). 



12 



W. B. WU AND M. WOODROOFE 



Lemma 4. If F and G are two distribution functions and e > 0, then 
there are continuous functions wi, . . . ,w m , depending only on e and G, for 
which \wi\ < 1 and L Wi dG = for all i and 

A(G,F) <e + max / WidF - / WidG 

Proof. The proof consists of first finding a and b for which G(a) + 
1 — G(b) < e, then partitioning [a, b] into a = xo < x\ < ■ ■ ■ < x m = b, where 
Xi — x^i < e/2, constructing piecewise linear functions ui for which Ui(x) = 1 
for x < and Ui(x) = for x > Xj, and then letting Wi = Ui — J R UidG. 
The details are omitted. □ 

Proposition 1. Assume that (£ n )nez is a strong mixing process with 
mean and finite variance. Then S* $ implies (10), and consequently 
(4). 

Proof. By Lemma 4, it suffices to show that 



w(z)F{x; dz] 



n{dx} -» 



as n — > cxd for all continuous w; : M — ► [—1,1] for which L w dQ = 0; and since 
the inner integral is just E x [w(S*)], it suffices to show that £'|£'[u;(S'*)|Xo]| — > 
as n — > oo for all such w. To see this, let m = m n be a sequence for which 
m — > oo and S m /a n 0; and let 5 n = (S'n+m — S m ) / a n . Further, let w : R — > 
[—1,1] be a continuous function for which f K wd& = and let w n (x) = 
E x [w(S*)\ and w n (x) = E x [w(S n )]. Then E[w n (X )\ = E[w(S*)} -»■ 0, since 

5* ^ f?|«; n (Xo) - t5„pr )| < ^k(SJ) " w (Sn)\ -» 0, since S n - 5 * 4 
as n — > oo ; and 

E\w n (X )\ 2 = J w n (X )w(S n ) dP < E[w{S n )) 2 + 4a m -> 0, 

by standard mixing inequalities [see, e.g., Hall and Heyde (1980), page 277]. 
So, E\w n (Xo)\ — > as n — > cxo as required. □ 

4. An invariance principle. Let 

for < i < 1, B n (l) = B n (l— ), where [xj denotes the greatest integer that is 
less than or equal to x. If (10) holds, then the finite-dimensional distributions 
of lB n converge to those of standard Brownian motion IB, and IMI^ converges 
in distribution to IB in the space -D[0, 1], both from the proof of Theorem 2. 
Relations (4) and (10) do not imply that M n converges in distribution to B 
in D[0, 1], however. 
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Example 3. Let G be a symmetric distribution function for which 

1 - G(y) 



y 2 log 3/2 (y) 

as y — ► oo. Let . . . , r)-i,rjo, r]i, . . . ~ $ and . . . , YLi, lo> • • • ~ G be inde- 
pendent random variables. Let clq = 0, a\ = 1/ log(2) and a& = 1/ log(/c + 1) — 
l/log(fc) for > 2, as in Example 1. Define by (8); let £' k = ^ + Y k — Yfc-i; 
and let £» = fi + ■ • ■ + and 5^ = ^ + • • • + Then (4), (11) and (12) 
hold for both SVi and with a\ ~ n/log 2 (n). In this example, 

— max \Yj. — Yq\—>oo 

in probability, so that B n and B^ cannot both converge to B. 

In Theorem 3 and Corollary 3, we consider the special case in which 
a 2 = lim n _ >00 o~ 2 /n exists. These results improve Theorem 2 and Corollary 
4 in Maxwell and Woodroofe (2000) by imposing a weaker condition as 
well as by obtaining a stronger result. The heart of the matter is whether 
there is a martingale approximation for which max/ c < n \ Sk — M nk \/^/n — > 
in probability. This question is addressed first. Two lemmas are needed. 

Lemma 5. Suppose that, for some q > 1, 

(15) \\E(S n \X )\\ = o(V^\og- q n). 

Then there is a martingale Mi , M 2 , . . . with stationary increments for which 
\\S n -M n \\=o{ y /H\og 1 ~ q n). 

PROOF. Recall the construction D nk and M nk = D n \ + • • • + D nk from 
(6) and also that max fc < n \\Sk-M nk \\ < 3max fc <„ ||S(S , fc|A" )||. Thus, max fc < n 
-^nfcll = o[y/n\og~ 9 (n)] in the present context. So, if m > 2 and m<n< 2m, 
then \\M nm - M mm \\ = o[^/m\og~ q (m)]. Since \\M nm - M mm \\ 2 = m\\D n i - 
D m i\\ 2 = m\\H n — H m \\ 2 , it then follows that 
00 00 

(16) £ \\H 2k -H 2k -4 < ^o[log-"(2 fc )] = [log 1 -"(^)]. 

k=j k=j 

It follows that H 2 k has a limit H, say, in L 2 (iri) and that \\H — H m \\ = 

o[log 1 ~ q (m)]. Letting D k = H(X k _i,X k ) and M n = L>iH \-D n , the lemma 

then follows from \\S n — M n \\ < \\S n — M nn \\ + y/n\\H n — H\\. □ 

Lemma 6. Let Y k , k € Z, be a second- order stationary process with mean 
and let T n = Y\ + • • • + Y n . Then 

d 



E 



max T 2 



<d^2 d ^||r 23 || 2 , 
3=0 
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where d = |"log 2 (n)] , the least integer that is greater than or equal to log 2 (n). 

Proof. The proof uses a simple chaining argument and appears in Doob 
[(1953), page 156] for uncorrelated random variables. Briefly, any integer 
k <n may be written as k = 2 ri H \- 2 r i , where < rj < ■ ■ ■ < r\ < d. So, 



j=l 



i = l 

where an empty sum is to be interpreted as 0, and 

d 2 d ~i 

max|r fc | 2 < dJ2 J2 \ T m - r (i-i)2J'| 2 , 

3=0 i=l 

from which the lemma follows by stationarity. □ 

Theorem 3. Let R n = S n — M n , where M n is as in Lemma 5. If g £ LP 
for some p > 2 and (15) holds for q>2, then a 2 = limn^oo o^/n exists, and 



(17) 



lim P 

n— >oo 



maxli?, ! > £\pri 







for each e > 0; and if (15) holds for some q > 5/2, then 



lim P x 

n— >oo 



max \Rj\ > E\fn 



for a.e. x(ir) for each e > 0. 



Proof. Let 7 = 1/4 — l/(2p) > 0, where p is as in the statement of the 
theorem, a = a m = \2 m ~<] , and b = b m = \2 m ^-^] . Then 



max \Rj \ < max 

j<2 m J l<k<b 



\R a k \ + max \R a k+j - R a k\ 

0<j<a 



Here, 



max \R a k+j — R a k\ 

0<j<a J 

< max \M ak+j - M ak \ + maxJS ak+j - S ak \ 

< max \M„u_i_a — M nk \ + a max |ofX,-)l 

~0<i<a' +J ' j<2-»' V J;l 
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for each k. So, 



(18) 



max \ Rj \ > 3ev 2 r 

j<2 m 

< P x i 



IMfc-M,- 



max — 

j<2 m Jl m 



>e 



> 



+ P X 



\Rak\ . 

max — > e 

k<b Jym 



where max* runs over all pairs (j, k) such that 1 < j, k < 2 rn and \k — j\ < a. 
The first term clearly tends to for a.e. x(ir), by the functional martingale 
central limit theorem. The second term in (18) also converges to for a.e. 
x(ir) by the Borel-Cantelli lemma, since 



px 



X 



\9(Xj)\ ^ e 
max — ; > — 

j<2 m Jym a 



ir{dx}<-2 m ^E\g{X x )\^ 



and the right-hand side is summable over m (recalling that a = [2 7m ] and 
observing that P7 + 1 — p < 0). Similarly, for the third term on the right-hand 
side of (18), 



px 



X 



\Rak\ ^ 

max — ■ > e 



ir{dx} = P 



\Rak\ . ' 

max — ■ > £ 

k<b V2™ 



£ z 



ak 



max - 

k<b J2 m 



and, letting d= [log 2 (6)]. 



E 



\R, 



ak\ 



max ■ 

k<b J2 m 



2 id 

<±Y2 d -*\\R a2i f 

— om t-~< 11 az 11 



< 



2 m 
d 

2'" 



i=o 
d 

i=0 



0(02* 



log 2 ("- 1 )(a2 i ) 



2g-3 



0(771 



4-2g 



by Lemmas 5 and 6. Relation (17) follows immediately, since ab = 0(2 m ) and 
d = 0(m); and if q > 5/2, then o{m A ~ 2q ) is summable and P x [maxfc<b \R a k\ > 
e\/2 m ] — > for a.e. cc, by the Borel-Cantelli lemma. □ 



Now let G n and be the distributions of M n and Brownian motion in 
D[0, 1], and let A denote the Prokhorov metric for D[0, 1]. 
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Corollary 3. // (15) holds for some q>2 and < a 2 < oo, then 
lim / A[^,G n (x;-)]ir{dx} = 0; 

n-*oo J x 

and if q > 5/2 in (15), then lim^^oo A[$f, G n {x; ■)]= for a. e. x{ir). 

Proof. Let K n (x; •) be the distribution of M n in D[0, 1]. Then K n (x; 
$ as n -> oo for a.e. x(tt), by the functional central limit theorem, and 



A[*,G n (x;-)]<A[*,K n (x;-)]+P* 



max \Rk\ > eo n 

k<n 



+ e 



for each e > 0. The case q > 5/2 follows immediately, and the case 2 < q < 
5/2 from f x P x [m&x k < n \Rk\ > ea n ]n{dx} = P[max fc < n \R k \ > sa n ]. □ 

Corollary 4. If (15) holds for some q > 2 and a 2 = 0, t/ien maxj.<„ l^fel/ 
y/n-^* 0; and if q > 5/2, f/ien limn^oo P x '[maxfc< n > e-v/n] = for a.e. 
x(ir) for each e > 0. 

PROOF. In this case S k = Rk- □ 

Remark 1. A simple sufficient condition for (15) is 

(19) \\E[g(X n )\X Q }\\=0{n- l / 2 \og~in). 

However, (15) allows processes of the form (8) with a n = n~^(—l) n for n > 1, 
where 1/2 < (3 < 1. In this case (19) is violated. Wu (2002) derived central 
limit theorems for processes of this sort whose covariances are summable 
but not absolutely summable. A typical example is the Gegenbauer process 
which exhibits long-range dependence and has oscillatory covariances [Beran 
(1994)]. 
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